Removal of interfering strokes in double-sided document images

نویسندگان

  • Chew Lim Tan
  • Ruini Cao
  • Peiyi Shen
  • Qian Wang
  • Julia Chee
  • Josephine Chang
چکیده

This paper addresses a special problem with historical document images where handwritten characters from the reverse side appear as noise on the front side and even interfere with the front side characters. A novel method to extract clear textual images from interfering and overlapping areas of text is presented here. The proposed algorithm is interesting in that, with an observation that the edges of the sipping strokes from the reverse side are not as sharp as those on the front side, it adopts an edge detection approach to suppress unwanted background patterns. By further concentrating on the orientation of the strokes, other remaining long and strong noisy edges are removed by using an orientation filter and a size filter. The proposed method proves to perform well regardless of the intensity differences between the foreground writing and the interfering strokes. The segmentation results of real images are shown and evaluated.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Directional Wavelet Approach to Remove Document Image Interference

In this paper, we propose a directional wavelet approach to remove images of interfering strokes coming from the back of a historical handwritten document due to seeping of ink during long period of storage. Our previous work required mapping of both sides of the document in order to identify the interfering strokes to be eliminated. Perfect mapping, however, is difficult due to document skews,...

متن کامل

A wavelet approach to double-sided document image pair processing

In this paper, we present a novel method for processing double-sided historic handwritten documents using wavelets. The method is specially designed to remove the interfering strokes from the reverse side due to ink sipping through pages after long periods of storage. The proposed method works by first matching both sides of a document page such that the interfering strokes are mapped with the ...

متن کامل

Matching of Double-Sided Document Images to Remove Interference

The National Archives of Singapore keeps a large volume of historical handwritten documents. One common problem with the archives is that over the years, ink sipped through the pages of these documents such that characters on the reverse side become visible and interfere with the characters on the front side. This paper addresses this problem and develops a novel algorithm to extract clear text...

متن کامل

Segmentation and Analysis of Double-Sided Handwritten Archival Documents

Historical handwritten documents are preserved in good condition in many national archives or libraries. One problem that many archivists are facing is the sipping of ink through the pages of certain double-sided handwritten documents after long periods of storage. This paper addresses this problem and develops a novel algorithm to extract clear textual images from interfering and overlapping a...

متن کامل

Character Extraction from Interfering Background - Analysis of Double-Sided Handwritten Archival Documents

The sipping of ink through the pages of certain double-sided handwritten documents after long periods of storage poses a serious problem to human readers or OCR systems. This paper addresses this problem through the recovery of content on the front side of a page from the interfering image caused by the handwriting on the reverse side. First, by adapting the Gaussian stochastic model, the inter...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000